Picture for Ruichuan An

Ruichuan An

Agent Skills Should Go Beyond Text: The Case for Visual Skills

Add code
May 31, 2026
Viaarxiv icon

Rethinking VLM Representation for VLA Initialization

Add code
May 25, 2026
Viaarxiv icon

VGGT-Edit: Feed-forward Native 3D Scene Editing with Residual Field Prediction

Add code
May 14, 2026
Viaarxiv icon

Uni-Synergy: Bridging Understanding and Generation for Personalized Reasoning via Co-operative Reinforcement Learning

Add code
May 11, 2026
Viaarxiv icon

OpenWorldLib: A Unified Codebase and Definition of Advanced World Models

Add code
Apr 06, 2026
Viaarxiv icon

DataFlex: A Unified Framework for Data-Centric Dynamic Training of Large Language Models

Add code
Mar 27, 2026
Viaarxiv icon

PEARL: Personalized Streaming Video Understanding Model

Add code
Mar 20, 2026
Viaarxiv icon

MME-CoF-Pro: Evaluating Reasoning Coherence in Video Generative Models with Text and Visual Hints

Add code
Mar 20, 2026
Viaarxiv icon

GENIUS: Generative Fluid Intelligence Evaluation Suite

Add code
Feb 11, 2026
Viaarxiv icon

GEBench: Benchmarking Image Generation Models as GUI Environments

Add code
Feb 09, 2026
Viaarxiv icon